Temporal Relational Modeling with Self-Supervision for Action Segmentation
نویسندگان
چکیده
Temporal relational modeling in video is essential for human action understanding, such as recognition and segmentation. Although Graph Convolution Networks (GCNs) have shown promising advantages relation reasoning on many tasks, it still a challenge to apply graph convolution networks long sequences effectively. The main reason that large number of nodes (i.e., frames) makes GCNs hard capture model temporal relations videos. To tackle this problem, paper, we introduce an effective GCN module, Dilated Reasoning Module (DTGRM), designed dependencies between frames at various time spans. In particular, via constructing multi-level dilated graphs where the represent from different moments video. Moreover, enhance ability proposed model, auxiliary self-supervised task encourage module find correct wrong Our DTGRM outperforms state-of-the-art segmentation models three challenging datasets: 50Salads, Georgia Tech Egocentric Activities (GTEA), Breakfast dataset. code available https://github.com/redwang/DTGRM.
منابع مشابه
Temporal Subspace Clustering for Unsupervised Action Segmentation
Action segmentation (segmenting a continuous sequence of motion data into a set of actions) has a wide range of applications and plays a role in many problems in computer vision. We look at subspace clustering as an unsupervised approach for this task. Classical subspace clustering methods uncover relationships within the data by learning codes for the samples (i.e. frames), but in this process...
متن کاملModeling Temporal Crowd Work Quality with Limited Supervision
While recent work has shown that a worker’s performance can be more accurately modeled by temporal correlation in task performance, a fundamental challenge remains in the need for expert gold labels to evaluate a worker’s performance. To solve this problem, we explore two methods of utilizing limited gold labels, initial training and periodic updating. Furthermore, we present a novel way of lea...
متن کاملTemporal Human Action Segmentation via Dynamic Clustering
We present an effective dynamic clustering algorithm for the task of temporal human action segmentation, which has comprehensive applications such as robotics, motion analysis, and patient monitoring. Our proposed algorithm is unsupervised, fast, generic to process various types of features, and applicable in both the online and offline settings. We perform extensive experiments of processing d...
متن کاملCausal Modeling for Supervision
Today, process control and monitoring are evolving to include plant safety and availability management, on-line diagnosis and maintenance policy. Human operators are at the highest hierarchical level in the organization of the control system. Supervision aims are to assist control operators in their decisionmaking tasks, to help them understand and identify operating situations underway in a ma...
متن کاملLearning Semantic Segmentation with Diverse Supervision
Models based on deep convolutional neural networks (CNN) have significantly improved the performance of semantic segmentation. However, learning these models requires a large amount of training images with pixel-level labels, which are very costly and time-consuming to collect. In this paper, we propose a method for learning CNNbased semantic segmentation models from images with several types o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i4.16377